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ON THETA-PALINDROMIC RICHNESS 
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Abstract. In this paper we study generalization of the reversal mapping realized by an arbitrary 
involutory antimorphisin 0. It generalizes the notion of a palindrome into a 0-palindrome - a word 
invariant under ©. For languages closed under © we give the relation between ©-palindromic complexity 
and factor complexity. We generalize the notion of richness to ©-richness and we prove analogous 
characterizations of words that are 0-rich, especially in the case of set of factors invariant under ©. A 
criterion for ©-richness of 0-episturmian words is given together with other examples of 0-rich words. 
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^ , 1. Introduction 



A palindrome is a word that reads the same from the left as from the right. In the last decade the 

study of palindromes in the field of combinatorics on words has notably grown. The motivation comes 

\^ • for instance from the application in mathematical physics or diophantine approximation. Two notions 

r-| i were mainly studied. First the palindromic complexity which counts the number of palindromes of 

given length in an infinite word. Its study resulted for example in a new equivalent definition of Sturmian 

words using palindromic complexity ([12]). The second notion is the notion of defect which measures 

the saturation by palindromic factors of a finite word ([6]) . An important class of words that are "rich" 

in palindromes in the utmost sense (called rich words) have been defined in [11, 13[. They have been 

04 ■ further studied and many characterizations have been found. For recent results and summary on rich 

K* I words see for instance [9, 10, 13[. 

A more general approach to the notion of palindrome has been considered by replacing the reversal 
mapping by a mapping having the needed properties, i.e., being an involution and antimorphism. A 



(•^ ■ word which is invariant under such mapping is called a pseudopalindrome or 6-palindrome. See 



for instance [2] or [7]. In this work we concentrate on 0-palidromic saturation in infinite words. We 
f~^ • derive an upper bound for the number of distinct 0-palindromic factors in a finite word. We define an 

(^ I analogue to rich words - words containing the maximum number of 8-palindromes possible. For this 

class of words that are rich in 0-palindromes (or shortly 0-rich) we prove similar equivalent statements 

as are known for rich words. The main result concerns words with set of factors closed under Q. We 
. , ■ prove the following relation between 0-palindromic complexity 'PQ{n) and factor complexity C{n) which 

r> \ is analogous to the case of the reversal mapping ([3]). VQ{n) denotes the function counting the number 

jrt ' of distinct 0-palindromic factors of length n in a given word and C{n) denotes the function counting the 

number of distinct factors of length n in a given word. 

Theorem 1. Let u he an infinite word with language closed under Q. Then 

Vein) + Vein + 1) < C{n + 1) - C{n) + 2, for all n £ N. 

We show that the equality is attained for 0-rich words, a result analogous to the case of the reversal 
mapping in [9[. 

Theorem 2. Let u be an infinite word with language closed under Q. Then u is Q-rich if and only if 
Vein) + Vein + 1) = Cin + 1) - C{n) + 2 for all n>l. 

We also make a connection to 0-episturmian words that are a generalization of episturmian words - 
a well known example of rich words. For a survey on such class of words see for instance [8]. Unlike 
episturmian words, 0-episturmian words are not all 0-rich. We give a condition when this is true. 
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2 STEPAN STAROSTA 

Section 2 summarizes notation and introduces some needed results from combinatorics on words. It 
includes known results on rich words and defines 0-palindromes. Section 3 establishes a relation between 
0-palindromic and factor complexity. Next section introduces the definition of ©-richness and a proof of 
Theorem 2. Section 5 provides examples of 0-rich words and contains also examples of 0-episturmian 
words which are not ©-rich. 

2. Notation and basic facts 

By A we denote an alphabet - a finite set of symbols, called letters. A finite sequence of letters 
w = W0W1W2 ■ ■ ■ Wn~i is called a word. Its length is denoted by \w\ and equals n. The set of all words 
over an alphabet A, together with the empty word e and the operation of concatenation, form a free 
monoid A*. 

The reversal of a finite word w — WqWi . . .Wn-i is the word w — u'„_iw„_2 • • -Wq. The mapping 
w I— > w is called reversal and clearly is a bijection on A*, li w — w then such word w is called a 
palindrome. 

An infinite sequence u — U0U1U2 ... of letters from A is said to be an infinite word over A. A finite 
word V is called a factor of a finite word w if there exist words v' and v" such that w — v'vv" . If v' ~ s 
then V is called a prefix oi w. If v" ~ e then v is called a suffix. A factor of an infinite word u is defined 
analogously with v" being infinite. 

The language C{\i) of an infinite word u is the set of all its factors. £„(u) denotes the set of factors of 
u of length n. We have £o(u) — {e} and C{vl) = lj^gpj£„(u). If for every factor w G C{u.) the reversal 
image of w is in >C(u), then we say that the language C{\i) is closed under reversal. 

For any factor w G /^{^) there exists an index i such that w — UiUi-^-i . . . 'Ui_|_|^|_i. Such an index is 
called an occurrence of w in u. We will say that a factor v is unioccurent in some word w ii v occurs 
exactly once in w. If every factor of u has at least 2 occurrences in u then the word u is said to be 
recurrent. One can verify that this is equivalent with every factor occurring infinitely many times. For a 
recurrent infinite word we define a complete return word^ to a factor w G C{u.) to be a factor with exactly 
2 distinct occurrences of w, one as a prefix and one as a suffix. 

A factor w G C{\i) is said to be left (right) special if there exist two distinct letters a, fe G ^ such that 
aw G C{x\) and bw G C{'a) {wa G C{vl) and wb G C{\i)). A factor is bispecial if it is right and left special. 

The factor complexity of an infinite word u is a mapping N 1— > N defined as C{n) := =ffCn{u). The 
palindromic complexity is a mapping N 1— >■ N defined as Vin) :— ^ {w G Cn{u) \ w = w}. 

In [3] the following connection between palindromic complexity and the first difference of factor com- 
plexity AC(n) — C{n + 1) — C{n) was proved. 

Theorem 3 ([3]). Let u be an infinite word with language closed under reversal. Then for all n we have 
(1) V{n)+Vin+l)<AC{n) + 2. 

Given an infinite word u and n G N, we define the Rauzy graph of order n of the word u, denoted 
r„(u), as follows. r„(u) is a directed graph whose set of vertices equals £„(u) and set of edges equals 
>C„4-i(u). There is an edge e G £„-)-i(u) going from the vertex vi to the vertex V2 if fi is a prefix of e and 
V2 is a suffix of e. If the word u is recurrent then for all n the graph r„(u) is strongly connected, i.e., 
there exists a directed path from any vertex to any other vertex. Let deg_|_(w) denote the outdegree of 
a vertex w. Especially for r„(u) we have deg^_(w) = # {a G .4 | wa G £(u)}. Rauzy graphs are a useful 
tool to calculate the complexity as we have 

AC{n)^ Y, (dcg+H-1). 

An n-simple path p is a factor of u of length at least n -\- 1 such that its prefix of length n and its 
suffix of length n are both special (left or right) and no interior factor of length n is special. We can now 
define the reduced Rauzy graph of order n of u r^(u). Its set of vertices equals the set of special (right 
or left) factors of >C„(u). There is an edge e from vi to V2 if there is a n-simple path p which begins with 



A shorter term "complete return" is also used. 
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vi and ends with V2- Note that if for a given length A^ there is no special factor, the word u is periodic, 
i.e., can be written as u = u" = vvv . . ., where w is a finite word. In such cases the definition of reduced 
Rauzy graph of order n results in an empty graph for all n > N. 

Consider now an infinite word u with language closed under reversal. We can define the super reduced 
Rauzy graph of order nofu denoted T'^(u) (first defined in [9]). Its set of vertices is formed by unordered 
pairs {w,w}, where w £ Cn{u) is a special factor. There is an undirected edge {e,e} from {v,v} to 
{w,w} if there is an n-simple path e starting with v otv and ending with w or w. Note that r"(u) may 
have multiple edges or loops. 

2.1. Palindromic richness. We say that a finite word w is rich if it has exactly \w\ + 1 palindromic 
factors. It is in fact the maximum number of palindromic factors possible. An infinite word is rich if all 
its factors are rich. This definition arises from the work done in [11] and in [13]. 

The following theorem summarizes some known characterizations of rich words. Listed equivalent 
definitions of richness have been proven respectively in [13], [9], ]10]. 

Theorem 4 (]13], ]9], ]10]). For any infinite word u the following conditions are equivalent: 

(1) u is rich, 

(2) any complete return word to a palindromic factor of u is a palindrome, 

(3) for any factor w of u, every factor of u that contains w only as its prefix and w only as its suffix 
is a palindrome, 

(4-) each factor ofu is uniquely determined by its longest palindromic prefix and its longest palindromic 
suffix. 

The following theorem gives a characterization of rich infinite words with language closed under rever- 
sal. It also answers the question whether the inequality (1) in Theorem 3 is reached for rich words. 

Theorem 5 (]9]). Let u be an infinite word with language C{u) closed under reversal. Then u is rich if 
and only ifv\n + 1) + V{n) = AC(n) + 2 for all n G N. 

In ]4] a different characterization of rich words with language closed under reversal has been shown. 
It is based on bispecial orders of factors. 

Let us mention two classes of words that we will refer to later and that are also rich. The well-known 
Sturmian words are exactly those words with factor complexity C{n) = n -I- 1 for all n S N. The second 
class consists of so-called episturmian words. They are defined as words whose language is closed under 
reversal and have at most one left special factor of every length. 

For further information on episturmian or rich words you can refer for instance to [5, 11, 13, 14]. 
A differently constructed class of rich words than the two mentioned above may be found in [1[. 

2.2. 0-palindromicity. The reversal mapping can in fact be understood as a special case of a more 
general mapping 9; A* ^^ A* which satisfies two conditions: 

(1) it is an involution, i.e., O^ = id; 

(2) O is an antimorphism, i.e., yv,w G A*, Q{vw) = 0(w)0{v). 

From now on, O will denote a mapping with such properties. It is easy to verify that O can only be a 
composition of the reversal mapping and an involutory permutation of letters. Such permutation has two 
types of cycles distinguished by their length 1 or 2. 

The definitions in the previous section can be modified in the following way. A finite word w is called 
a Q -palindrome if w = 0(w). O -palindromic complexity is a mapping N h- > N defined as 

VeH ■■=i^{we £„(u) I w =. e{w)} . 

The language of u is closed under 6 if for any factor w G C{u) we have Q{w) G C{u}. The definition of 
super reduced Rauzy graph is analogous: for words closed under @ the reversal mapping is replaced by 

e. 

As we will see later, a crucial difference between the reversal mapping and a O mapping different from 
the reversal mapping is that there exists a letter a such that a ^ 0(a). Therefore there exist words that 
have no non-empty 0-palindromic suffix at all. 
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The research on O-pahndromicity has led to several generalizations of known results. We will recall 
one such generalization. A word is G-episturmian if its language is closed under Q and if for each n 
there exists at most one left special factor of length n. Such words are discussed for instance in [8]. In [2] 
these words are called pseudopalindromic. A more general class that we will mention later is discussed 
in [7]. 

3. 9-PALINDROMIC AND FACTOR COMPLEXITY 

As for the reversal mapping, if u has its language closed under Q, then it is recurrent. To see that, 
consider a non-0-palindromic factor w 6 'C(u). As £(u) is closed under Q we can find an occurrence of 
0{w) and so find a factor r having w as a prefix (respectively suffix) and Q{w) as a suffix (respectively 
prefix). If r is not a 0-palindrome, then we can find a different occurrence of Q{r) and in it a second 
occurrence of w. If w or r is a 0-palindrome it suffices to consider a non-0-palindromic factor that 
contains w or r as factor. Such factor either exists (for instance one can look only among prefixes of u) 
or u is trivial (periodic with period 1). 

The following observation, stated as a lemma, will be used in the proof of Theorem I. We provide a 
short proof. 

Lemma 6. Let u be an infinite word with language closed under O. If there exists an integer N such 
that there is no special factor of length N , then for all n > N we have 'PQ{n) + 'PQ{n + 1) =2. 

Proof. As u is recurrent, it is clear that for all n> N , r„(u) is a cycle. Take n > N. We will consider 
an undirected graph G whose vertices are defined by unordered pairs {w, Q{w)} for all w S i3„(u). Let e 
be an edge of r„(u) going from the vertex w to the vertex v. For each pair {e, 0(e)}, there is an edge in 
G between the vertex {w, Q{w)} and the vertex {v, Q{v)}. Note that there may be multiple edges and 
also loops (in the case e = 0(e)). According to the terminology already used this graph may be called 
"super Rauzy graph of order n". 

As r„(u) is a cycle, it is clear that G is formed by one elementary path p and at most 2 loops - 
one possibly on the first vertex of p, one possibly on the last vertex of p. Suppose there is an interior 
vertex {w, Q(w)} in p such that w — Q{w). Then w would be a special factor of length n and we have a 
contradiction. Denote by {s, 0(s)} the first vertex of p. If there is a loop on {s, 0(s)}, then it is clear 
that s 7^ 0(s) as otherwise s would be special. On the other hand, if there is not a loop on {s, 0(s)}, 
then s ~ 0(s) as r„(u) would be disconnected. The reasoning is analogous for the last vertex of p. 

As a loop in G is in fact a 0-palindrome of length n + 1, we have at most 2 0-palindromes of length 
n or n + 1 - one associated with the first vertex of p and one with the last vertex of p. 

D 

We will now prove the analogue of Theorem 3 for languages closed under Q. 

Proof of Theorem 1. Consider the operation 9 which to every vertex w of a Rauzy graph associates Q{w) 
and to every edge e associates 0(e). Since we supposed the language closed under 0, is well defined 
on all Rauzy graphs of u. In fact, 6 maps every Rauzy graph r„(u) onto itself. We will omit u in the 
notation of Rauzy graphs from now on. 

Fix n. We are interested in n-simple paths. As -C(u) is closed under 0, u is recurrent. Thus the graph 
r„ is strongly connected and every vertex and every edge belongs to an n-simple path. 

For an edge satisfying 9{e) — e we find the n-simple path p which contains e. Then the operation 9 
must map the path represented by p onto itself, i.e., p = 0(p)- Similarly, for a non-special vertex w such 
that 9{w) — w, the n-simple path containing w is mapped by 9 onto itself. 

Note that an n-simple path can contain at most one 0-palindrome of length n or n + 1 which is not its 
prefix or suffix. To prove that suppose the contrary. Let z be an n-simple path containing two interior 0- 
palindromic factors of length n or n-l- 1. Denote pi the 0-palindrome with leftmost occurrence in z which 
is not a prefix of z. Since there is another 0-palindrome of length n or n + 1 and z is a 0-palindrome, 
there is another occurrence of pi as an interior factor of z. Let r be the shortest left special factor such 
that pi is its suffix. If there is no occurrence of r in z, then there is another occurrence in z of the prefix 
of z of length n which is neither its prefix nor suffix. If there is at least one occurrence of r in z, then 
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either the prefix of length n of r is a special factor of z different from its prefix or suffix, or again the 
prefix of z of length n has another occurrence in z different from its prefix or suffix. In both cases, we 
can find an interior occurrence of a special factor of length n - a contradiction with z being an n-simple 
path. 

To give an upper bound on VQiji) + 1^0(71 + 1) therefore consists in finding the number of n-simple 
paths in r„ that are mapped by 9 onto themselves and the number of special 0-palindromes of length n. 
For that we can consider the reduced Rauzy graph F^. 

The set of vertices of F^ can be partitioned into two disjoint subsets. The first subset is given by special 
0-palindromes of length n. Let a denote the number of such vertices. The second subset is formed by 
the rest of the vertices - non-0-palindromic special factors of length n. Let the number of such vertices 
be denoted by 2/3. The total number of vertices in F^ is then a + 2/3. 

It is clear that the super reduced Rauzy graph F" has exactly a + /3 vertices. As F^ is connected, 
it has at least a + /3 — 1 edges. Thus the number of edges in T'^ is at least 2{a + /3 — 1). These edges 
correspond to the n-simple paths in F„ which are not mapped by 9 onto themselves. 

As already mentioned, the number of 0-palindromes of length n or n + 1 is bounded by the number 
of n-simple paths in F„ which are mapped onto themselves, and the number of special 0-palindromic 
vertices. We thus have 

Vein) + Vein + !)< ^ deg+(w) - 2(a + /3 - 1) + a, 

w is special 

where the first summand is the number of all n-simple paths in F„, the second summand estimates the 
number of n-simple paths which are not mapped onto themselves, and the third counts the number of 
special 0-palindromic vertices. We obtain 

Vein) + Vein + I) < J2 deg+(w) - (a + 2/3) + 2 = 

wGCn{u.) is special 

(2) = E (deg+H-l)+2 = 

w^Cn{u) is special 

AC(n) + 2. 

The proof is finished except for the special case where the word u is periodic. This implies existence 
of N such that for all n > iV we have AC in) ~ 0, i.e., there is no special factor of length n. It this case 
the reduced Rauzy graph of order n is empty. According to Lemma 6 we have Vein) + Vein + 1) = 2 
for all n > A^, which completes the proof. 

n 

Corollary 7. Let £(u) be closed under O. Let n > 0. The equality 

(3) Vein) + Vein + 1) = AC(n) + 2 
holds if and only if both of the following conditions are satisfied: 

(1) the graph F" after removing loops is a tree, 

(2) any simple path forming a loop in the graph F" is mapped by onto itself. 

Proof. First, we rewrite the equality in the relation (2) as 

AC(n) + 2 = #n-simple paths in F^^ - a - 2/3 + 2 

where a is again the number of special 0-palindromes of length n and 2/3 is the number of non-0- 
palindroniic special factors of length n. Let p denote the number of n-simple paths that are mapped by 
9 onto themselves and m the number of n-simple paths that are not mapped onto themselves. 
We will prove the equivalence directly. Suppose that Vein) + Vein + 1) = AC(n) -I- 2. We have 

a+p = Vein)+Vein+l) = AC(n) + 2 = p + m - a - 2/3 + 2. 

Therefore m, = 2(Q! + /3 — 1). From the proof of Theorem 1, this is equivalent with having exactly the 
minimum number of non-0-palindromic n-simple paths needed so that the graph F'^ is strongly connected. 
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As loops are not needed to have the graph strongly connected, they are counted in the number p. These 
two facts are equivalent with F" being a tree after removing loops and all loops being 0-palindromic. 

D 

A natural question is whether there exist such words for which the equality (3) is satisfied for all n. 
The answer is positive only for Q equal to the reversal mapping. In any other case we have T'eCl) < i^-^ 
and thus 

Pe(0) + Veil) < (#^ - 1) + 2 = AC(0) + 2. 

However, there are words saturated in 0-palindromes that satisfy the equality for all n > 1. Examples 
of such words are given in Section 5.1. 

4. 6-RlCHNESS 

Recall that a finite word is rich if it has the maximum possible number of palindromic factors. If we 
want to use the same notion of richness for 0-palindromes, we need to take into consideration that there 
is always fewer 0-palindromes since some letters may not be 0-palindromes. To define richness we need 
to evaluate the number of these letters contained in a given word. We introduce the set j{w) as follows 

j{w) := {{a, e(a)} \ae A,aj^ e{a),a or e(a) £ C{w)} . 

It is clear that if w' is a prefix of some finite word w, then ij=^{w') < ^"/(w). It is also clear that if for 
some letter a; G ^ we have j{w) ^ j{wx), then the word wx has no 0-palindromic suffix as 0(x) does 
not occur in it. Also, as ^{wx) — 7(w)u{ {x, Q{x)} }, one can see that in this case ^j{wx) — #7(w) + l. 

The following proposition gives the maximum number of 0-palindromes that may occur in a finite 
word. 

Proposition 8. Let w be a finite word. Then the number of its Q -palindromic factors is bounded by 

Proof. The proof is done by induction on |w|. The statement is clear for \w\ < 1. Suppose that for 
\w\ < N, w contains at most |w| + 1 — ^j{w) different G-palindromes. Let x ^ A. We will count the 
number of 0-palindromes in the word wx. It is clear that the number of 0-palindromes in wx is equal 
to the number of 0-palindromes in w plus the number of new 0-palindromic factors - 0-palindromic 
suffixes of wx that do not occur in w. 

First suppose we have j{w) ^ "f{wx). As there is no 0-palindromic suffix the number of 0-palindromes 
in wx equals the number of 0-palindromes in w which is at most |w| + 1 — #7(w) = \wx\ + 1 — ^"f{wx). 

Suppose "f{wx) — "/(w). Suppose for contradiction that wx contains more than \wx\ + 1 — f/=j{wx) 
different 0-palindromes. As \wx\ + 1 — =ff^j(wx) = {\w\ + 1 — #7(w)) + 1, it is clear that there must be 
at least two different 0-palindromes p and q such that w = yp and w — zq that do not occur previously 
in w (as they were not yet counted). We can assume that \p\ > \q\ and p is the longest of such suffixes 
of wx. On the other hand we have p = vq — qO{v) where \v\ > 0. Therefore every such factor q occurs 
already in w and has already been counted in the number of 0-palindromes in w - a, contradiction. We 
can see that the number of 0-palindromes in wx is equal to either the number of 0-palindromes in w 
or the number of 0-palindromes in w plus 1. Therefore the upper bound is juij + 1 — 4j=^{w) + 1 which 
equals \wx\ + 1 — ^"f{wx). D 

We can see from the preceding proof that if there is a new 0-palindromic factor, it is the longest 
0-palindromic suffix. We will now define a class of words having maximum number of 0-palindromes. 

Definition 9. A finite word w is 0-rich if it contains |w| + 1 — ij=^{w) 0-palindromic factors. 
An infinite word is 0-rich if all its factors are 0-rich. 

One can see that all factors of a finite G-rich word are 9-rich as well. Thus, if an infinite word has all 
its prefixes 0-rich, then all its factors are 0-rich and it is by definition 0-rich. 

The following equivalence is a consequence of the last remark and of the proof of Proposition 8. It is 
a 0-rich analogue of results in [11, 13]. 
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Proposition 10. An infinite or finite word u is Q-rich if and only if the longest Q-palindromic suffix of 
each prefix p is unioccurrent except for prefixes having the form px, with x G A and "/{p) =/= "/(px). 

Proposition 10 leads to the following corollary, which is a 8-rich analogue of a result in [9]. 

Corollary 11. Let u be an infinite 0-rich word. Then for all w G 'C(u), w ^ Q{w), the occurrences of 
w and Q{w) alternate. 

Proof. Suppose there exist a factor w e -^Cu), w ^ 0(w), and a factor v G 'C(u), |w| > \w\, such that 
w is its prefix and its suffix and there is no occurrence of @{w) in v. Since the last letter of v already 
occurred in v and v does not have a unioccurrent O-palindromic suffix, according to Proposition 10 we 
have a contradiction. D 

The following proposition gives an equivalent characterization of 0-rich words. 

Proposition 12. Let u be an infinite or finite word. Then u is rich Q-rich if and only if for each factor 
V ^ e of u, any factor of u beginning with v and ending with 0{v), containing no other occurrences of v 
or &{v), is a O -palindrome; and for all a Cz A, the occurrences of a and 0(a) in u alternate. 

Proof. (=>): Let w be a factor of u. Consider a factor r containing w as a prefix and 0(w) as a suffix, 
with no other occurrences of v or Q{v). 

If \v\ = |r|, then v ~ Q(v) and r is a 8-palindrome. 

Let us assume \v\ < \r\. Let s be the longest 0-palindromic suffix of r. (There is such a suffix as r is 
0-rich and the image by 8 of the last letter of r has already occurred in r as the first letter.) 

If \s\ < \v\ then we can write 8(f) = zs with z ^ A* . Then v ~ 8(s)8(z) ~ sQ(z). We have a second 
occurrence of s in r - a contradiction with its unioccurrence. 

If \v\ < \s\ < \r\ then as 8(f) is a suffix of s we can write s ~ zQ{v) — vO(z). Therefore we can find 
a second occurrence of f in r - a contradiction with choice of r. 

The last possibility left is s = r and thus r is a 8-palindrome. 

Alternation of a and 8(a) follows from Corollary 11. 

(<^): Suppose for contradiction that u is not 8-rich. Then there is a prefix p whose longest 8- 
palindromic suffix s occurs at least twice in p or there is a prefix p with no non-empty 8-palindromic 
suffix such that p ends in a non-8-palindroinic letter a and has at least one another occurrence of a or 

e(a). 

In the first case, as s occurs in p for a second time, there is a complete return word to s contained in 
p which is its suffix. The complete return word is a 8-palindrome and is longer than s - a contradiction. 

In the second case, as occurrences of a and 8(a) alternate, we can find the rightmost occurrence of 
8(a) in p. The suffix starting at that occurrence is then a word beginning with 8(a), ending with a and 
with no other occurrences of a or 8(a). We supposed such words were 8-palindromic. Thus p has a 
8-palindromic suffix and we have a contradiction. 

D 

In the last proof we did not need all the assumptions to prove the second implication. We can formulate 
a weaker sufficient condition as follows. 

Proposition 13. Let u be an infinite or finite word satisfying 

• all complete return words to O -palindromes are Q -palindromes; 

• for any a G A, a ^ 8(a), occurrences of a and 8(a) alternate; 

• factors beginning with a G A and ending with 8(a), with no other occurrences of a or 8(a), are 
Q -palindromes. 

Then u is Q-rich. 

The following lemma will be used in the proof of Theorem 2. 

Lemma 14. Let u be an infinite word with language closed under 8. If Vq{1) + 7^0(2) = AC(1) + 2 then 
for all letters a, a ^ 8(a), occurrences of a and 8(a) alternate. 
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Proof. Let a € A and suppose a is not a O-palindrome. We will look at the super reduced Rauzy graph 
r" for which the statements in Corollary 7 hold. 

Suppose a is not special. Then a is a factor of exactly one l-simple path p. If the edge {p, 6(p)} is 
a loop in F" then the occurrences of a and Q{a) must alternate && p — <d{p). If {p, 0(p)} is not a loop 
then there is no path in F" that would contain 2 occurrences of p and no occurrence of 0(p). Such path 
would form a cycle in F" which is not possible. 

Suppose now that a is special. Then {a,<d{ay\ is a vertex of F'/. We will look at paths going from 
{a, 6(a)} again to {a, 0(a)} which are not going through the vertex {a, 0(a)}. There are 2 possibilities. 

In the first case such path corresponds to a loop in F'/ connected to {a, 0(a)}. Therefore, the corre- 
sponding path in Fi is a l-simple path and is a 0-palindrome beginning with a or 0(a) respectively and 
ending with 0(a) or a respectively. 

In the second case such path starts with an edge {e, 0(e)}. It is clear that the end of this path is again 
the same edge {e, 0(e)} as otherwise there would be a cycle in F". If the corresponding path in Fi starts 
with e, it must end with 0(e) as otherwise there would be a multiple edge (forming a cycle) between the 
vertices of F" connected by {e,0(e)}. Therefore, if such path starts with a, it must end with 0(a) and 
the proof is finished. 

n 

In what follows, to ease the notation, we will borrow some elements from the free group generated 
by A and use them in the following way. Let w be a finite word. By w~^ we denote an element whose 
concatenation with w produces the empty word, i.e., ww^^ — w^^w — s. The notation is justified by the 
fact that we will never concatenate w^^ with anything else than w. 
Proof of Theorem 2. 

(H: 

Suppose that u is not rich. According to Lemma 14 non-0-palindromic letters alternate. Proposition 12 
implies there exists a factor v G ^(u) and a factor r G ^(u) such that r begins with v and ends with 
0(w), with no other occurrences of v or 0(w), and r is not a 0-palindrome. Then r is of the form 

r = vwxsyQ(w)Q{v), 

where w and s are finite words and x,y ^ A, x j^ 0(2/)- As the language is closed under 0, we can 
see that v'wQ{y) G C{u). This means that the factor vw is right special and therefore {vw, Q{vw)} is a 
vertex in F(' ,. 

If r is a luuij-simple path then r forms a loop in Fi'^^i. Therefore according to Corollary 7 it is a 
0-palindrome - a contradiction. 

Suppose now that r is not a |ww|-simple path, i.e., contains other special factors of length \vw\. Denote 
by p, q two factors of r such that they are |ww|-simple paths and p is a prefix and q a suffix of r. Let p' 
denote the special suffix of length \vw\ of p and q' the special prefix of q of length \vw\. It is clear that 
vertices {vw, 0(i;w)} and {p' , Q(p')} are connected as well as {vw, <d{vw)} and {q' , ^{q')} are connected. 

If p' and q' coincide, i.e., r — pp' q, then as x ^ 0(y) we have a multiple edge in Fi^^i - a cycle in F]^^ 
and thus a contradiction. Up' and q' do not coincide there exists a path in Fj^^, leading from {p' , 0(p')} 
to {q' ,&{q')}, avoiding {vw,Q{vw)}, which together with the two paths connecting {vw,0{vw)} forms 
a cycle and we have again a contradiction with Corollary 7. 

(^): 

We say that a factor r is a realization of an edge in F" between vertices {v, 0(w)} and {w, Q{w)} if 

• r has a prefix v or 0(u) and a suffix w or 0(^7) OR 

• r has a prefix w or 0{w) and a suffix v or 0(w), 

and contains no interior occurrences of factors v, 0(u), w and Q{w). 

We will denote hy v ^ w a realization of an edge between vertices {v,Q{v)} and {w,Q{w)} which 
starts with v and ends with w. 
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First, we are going to show that for every pair of distinct vertices {v, 0(w)} and {w, Q{w)} there are 
exactly two reahzations of an edge between them and that they are Q images of each other. Let w ^ v and 
w ^ Q{v). Suppose without loss of generality there is a realization v^w. We will find two consecutive 
realizations, denoted ri and r2. See Figure 1. 



Figure 1. Two consecutive realizations u-^w. 

According to Corollary 11 occurrences of each factor alternate with occurrences of its O image. There- 
fore we can find occurrences of Q{v) and Q{w) between occurrences of ri and r2. Let us first find the 
leftmost occurrence (after ri) of G(u). As every factor beginning with v and ending with Q{v) is a 
0-palindrome, we can find a realization Q{w) ^Q{v). Denote this realization by qi. See Figure 2. Note 
that the occurrences of the factor w in ri and 0{w) in qi may coincide. 

0-palindroinc 



v~^w . . .Q{w) ~7 0{v) v-^w 

Figure 2. The leftmost occurrence of realization Q(w)—rQ(y). 

Let us now find the rightmost occurrence (before T'i) of 0(w). Analogously, we find a realization 
9(u')^0(u). Denote this realization by qi. See Figure 3. 

0-palindromc 



v~7W Q(w)^Q{v) . . .v^w 
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Figure 3. The rightmost occurrence of realization 8(w)^0(u). 

Suppose now that occurrences of q\ and qi are not equal. If they are not consecutive, we can find a 
realization of 0(w) —r 0(u), denoted by g^.? such that gj and qi are consecutive. Now, using the same 
arguments as for consecutive realizations r\ and r2, we can find a realization v—rw between gj. a-nd qi. 
This is a contradiction with r\ and ri being consecutive. Thus the occurrence of q\ equals the occurrence 
qi and q\ = qi. 0-palindromes depicted in Figure 2 and Figure 3 imply q\ = 0(ri) and qi = Q{ri), 
respectively. Therefore ri = ri and we deduce that every realization v~tW equals ri and every realization 
Q{v)^Q{w) equals 6(ri). 

So far we have shown that for every pair of distinct vertices {w, Q{v)} and {w, 0(w)} there are exactly 
two realizations of an edge between them and that they are Q images of each other. This implies that F" 
after removing loops is a tree. Corollary 11 claims that occurrences of w and Q{w) alternate, thus any 
realization between the same vertices {w, <d{w)} is of the form w^Q{w) or Q{w)^w. By Proposition 12, 
such realizations are 0-palindromes. These realizations are in fact loops in F". Using Corollary 7 these 
two parts together imply that the equality (3) holds. 

D 

5. Examples of 8-rich words 

In this section we will provide examples of 0-rich words. We first concentrate on a generalization of 
episturmian words, namely 0-episturmian words, mentioned earlier. The following definition will serve 
to reveal the structure of the language of a 0-episturmian word. Its main aim is to ease the notation. 



10 STEPAN STAROSTA 

Definition 15. Let u be a Q-episturmian word. Denote by (wk)i^^Q the sequence of all bispecial factors 
of u, ordered by increasing length and starting by wo = s with K possibly equal to +cxd. Furthermore let 
Zk denote the unique letter such that WkZk is a prefix of Wk+i- Denote by pk G A* the word such that 
Wk+i = Pk^{zk)wk. For all k E N and all a E A let rk.a denote the complete return word to Wk beginning 
with Wf^a, if it exists. 

For a letter a ^ 0(a), let ka denote the maximal index k < K , if it exists, such that Wk has non-Q- 
palindromic complete return words beginning with WkO and with WkO{a). If ka does not exist, we will 
consider it to be +cx}. 

Note that since there is at most one complete return word Wk having a prefix WkO, the definition of 
Tfc^a is correct. As the language of u is closed under O, rk.a exists if and only if r^Q/a) exists. Note also 
that fca > 1 for every letter a 7^ 8(a). 

Proposition 16. Let u be a 0-episturmian word. 

(1) If a — Q{a) and rk^a exists, then rk.a is a Q -palindrome; 

(2) ifa^eia), then 

a. if k < ka, then rk^a equals Wkawk, furthermore a is not contained in Wk or in any other 
complete return word to Wk except for rk^a', 

b. if k > ka, then at most one of the words rk.a o,nd rk^Q(a) exist. 

Proof. The proof will be done by induction on k. 

For fc = 0, we have wo ~ e and the claim holds trivially (for non-0-palindromic a we are in the case 
(2a)). 

Suppose the claim holds for k. We will prove it for fc + 1. We need to consider especially the case 
fc + 1 == fca + 1 for all a to be able to assume also (2b) . 

Case (1): 

Let a = 0(a). If a = z^ then rk+i^a — Pk^{zk)wkas for some word s. As there is at most one complete 
return word to Wk beginning with WkO, we have Wkos — WkaO{pk). Thus, rk+i^a = PkO.WkaO{pk) which 
is a 0-palindrome. 

li a ^ Zk suppose that rk+i.a is not a 0-palindrome. Therefore we can write rk+i.a ~ Pk^{zk)wkO,x 
and Wkox ~ yawkV — rk.aV with x,y,v G A* . By the assumption rk^a is a 0-palindrome which implies 
that V cannot start with Zk as rk+i^a would be a 0-palindrome. In other words we can find a factor 
having the form awkb with b ^ Zk. 

We can then find two different factors Q{zk)wka G jC{u.) and Q{b)wka G C{u). Thus WkO is left special 
which is a contradiction with WkZk being a second left special factor of the same length. Thus b = Zk and 
we have rk+i^a = PkQizk)'''k,aZkQiPk) which is a 0-palindrome. This concludes the proof of the case (1). 

Case (2): 

Let now a y^ Q{a). We distinguish the following three cases according to the value of ka. 

I. k + l<ka 

Suppose that a is contained in Wk+i. This means that WkOWk is a factor of Wk+i and so is 
WkO{a)wk. 

Suppose Zk ~ a. We have Wk+i ~ WkawkpwkO{a)wk where p is a 0-palindrome or Wk~^. 
Then the complete return word rk+i^a equals Wk+iawkpwkQ{a)wk which is a 8-palindrome and 
contradicts the assumption fc + 1 < ka. 

Suppose now Zk ^ a. We can then find a factor of Wk+i of the form bwkOWkC with b ^ Q{zk). 
(If b = 0(z/c), then we would have a shorter bispecial factor than Wk+i but longer than Wk.) Since 
Tk+i,a exists we can find a factor Q{zk)wka. Thus WkO is right special which contradicts WkZk being 
right special of the same length. 

We will now prove that rk+i.a is the only complete return word to Wk+i to contain a. Suppose 
that a is contained in rk+i^b with b ^ a. Since a is not contained in Wk+i, ffc+i,& is of the form 
Wk+ixwk+i where x G A* contains a. 
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We can find a factor of x equal to cwkd with c ^ O(zfc)- As rk+i,a exists there is a factor 
0(zfc)wfca which implies that WkCi is right special. Since especially a ^ Zk 'as. otherwise it would be 
contained in Wfe+i, we arrive at a contradiction. 

It remains to show that rk^i,a — Wk+iawk+i- Since fc+ 1 < fc^ we know from the definition of ka 
that both Tfe+i d and rj,^iQi^\ exist. We can see that r^^ia ~ Pk^izk)rk,aX with x € A* . If the first 
letter of x is Zk we are finished. Suppose it is not true and denote that letter by xq ^ z^. We have 
aWkXfj S 'C(u). As rk-i^iQ/^i) exists (from the definition of ka) we know that G(zj,)wj,0(a) G >C(u) 
which implies aw^Zk € 'C(u). This implies that awk is right special. As 0{zk) ^ a we can see that 
we have 2 right special factors of the same length and thus xq must be Zk- Finally as rk^a — Wkawk 
we have rk+i,a = PkG>izk)rk.aZkG>{Pk) = Wk+iawk+i- 
II. fc + 1 = A:„ + 1 

If Wk+i does not exist, then rk+i,a does not exist too. 

If Wk+i exists, suppose that both rk+i,a and rk+i^Q(a) exist and at least one of them is a Q- 
palindrome. We can suppose without loss of generality that rk^a is a 0-palindrome. We can then 
find a factor Q{a)wkZk- ?'fe+i.e(a) contains a factor Q{a)wkb with 6 ^ z^; as otherwise we would 
have two complete return words rk+i^a- The factor Q(a)wk is therefore right special which implies 
Zk — a. As 0{a)wk&{a) G >C(u) we have also awka G C{u) and awk is not right special. Since WkO, 
can only be extended as WkaWk we arrive to a contradiction - u = pwkawkawka . . . which implies 
u is not recurrent (6(a) occurs only in the word p). 
III. k + l>ka + l 

It suffices to see that if rk^a does not exist then rk+i^a does not exist too. 

n 

Corollary 17. For all letters a ^ 6(a) the value of ka is finite. 

Proof. If u is periodic, then K is finite and so is ka. For u aperiodic if ka = +oo then we stay in the case 
(2a) for all k. In other words all bispecial factors do not contain a which is clearly impossible. D 

From the above corollary we can deduce that if 6 is different from the reversal mapping, then there 
exists k such that AC{\wk\) < #.4 — 1, i.e., the growth of factor complexity is not constant. 
The following corollary will serve to derive the criterion for richness of 6-episturmian words. 

Corollary 18. Let k > ka and rk^a is a Q -palindrome. Then either rk+i,a does not exist or is a 6- 
palindrome. 

Proof. Suppose that rk+i^a exists and is not a 6-palindrome. Then rk+i.a begins by PkQ{zk)fk,ab where 
Zk ^ h <E A. As Q{a)wkb G C{u) so is Q{b)wka G C{u). Together with Q{zk)wka G jC{u) we see that 
WkO is left special, i.e., Zk = a. This is impossible as we would have rk+i.a = Pk^{o-)wkaO{pk) — a 
6-palindrome. D 

Corollary 19. Let u be a Q-episturniian word. Let Ik = \wk\. If there exists an index fco such that 
Peilko) + 'Peiho + 1) = ^Ciho) + 2, then for all n > h. 

Vein) +Ve{n + l) = AC(n) + 2. 

Proof. First note that as u is 6-episturmian, F" has at most one vertex for all n. 

According to Corollary 7, the graph T'{ consists of one vertex and Ve{lko) + 'Peiha + 1) ^ 1 edges 
which are loops mapped onto themselves. Therefore all complete return words to Wkg are 6-palindromes. 
Thus, using Corollary 18, for all k > ko, the graph F" consists of one vertex with loops mapped onto 
themselves. Thus using again Corollary 7, we see that Ve{lk) + Veilk + 1) = AC{lk) + 2. 

We will prove that for all k > ko, for all n such that Ik-i < n < Ik, the graph F" is of the same form 
as F" (one vertex with loops mapped onto themselves) and thus according to Corollary 7 the equation 
Pq (n) 4- Pe ("■ + !) = AC(ri) + 2 holds. It suffices to see that every n-simple path is in fact a 6-palindromic 
factor of a (/fc)-simple path or Wk. 

It remains to see the case of finite value of K. Then for all n > K, there is no special factor of length 
n. According to Femma 6, we have 2 — Ve{n) + Pe{n + 1) = AC{n) + 2. 
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n 

The following corollary stems from combining the previous claim with Theorem 2. It gives a criterion 
to see whether a 0-episturmian word is 0-rich or not. 

Corollary 20. 0-episturmian word is Q-rich if and only if'Pe{l) + T'elS) = AC(1) + 1. 

5.1. Examples. This section contains examples of infinite words with properties mentioned above. 

Example 21. Let tt denote a morphism {0,1}* n> {a,a',c}*: 

M. aa' 



1 H> aa'c 
Denote Q an involutory antimorphism defined as 

: a I— > a', a' i— > a, c i~^ c. 

Then the infinite word u = 7r(v), where v is a Sturmian word, has the following properties: 

(1) is 0-episturmian, 

(2) satisfies the equality Vein) + Ve{n + 1) = AC{n) + 2 for n > 1, 

(3) is 0-rich. 

As V is Sturmian, it is clear that u is aperiodic. Furthermore if we enumerate /32(u) we see that 
AC(1) = 1. These two facts imply that for n > 1 we have AC(n) = 1. As the language £(v) is closed 
under reversal, the language C{u.) is closed under Q. 

To prove the rest we can either apply directly Corollary 20 or enumerate 0-palindromic complexity. 

In the following examples the mapping is considered the same as in Example 21. 
Example 22. Let n be the following morphism 

TT : H> a'cacc, 1 M> a'cac. 

We set u — 7r(v) where v is a Sturmian word over {0, 1}. Then u is 0-episturmian and not 0-rich. 

The proof is similar as in Example 21. From v we can deduce that the word u is aperiodic, uniformly 
recurrent and contains infinitely many 0-palindromes. The last two properties imply that £(u) is closed 
under 0. As AC(2) = 1 and there is one special factor of length 1 - the factor c - it is clear that u is 
0-episturmian. 

u is not 0-rich: factors ca'c and cac do not have a unioccurrent 0-palindromic suffix. 

Next example is a 0-rich word which is not 0-episturmian. 

Example 23. Set tt as follows 

TT : H- > aa' , 1 H> acca' . 

Let V be a Sturmian word over {0, 1}. Then the word 7r(v) is 0-ricli and is not 0-episturniian. The proof 
is left to the reader. 

The following two examples concern periodic infinite words. 

Example 24. The word (caca') is periodic but not 0-rich. The factors ca'c and cac do not have a 
unioccurrent 0-palindromic suffix as before. 

Example 25. The word (ccaa') is periodic and 0-rich. It follows from the fact that we can find a 
unioccurrent 0-palindromic suffix for each prefix of this word. 
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6. Final remarks 

A wider class of infinite words related to 0-palindromes has been studied in [7]. The class is constructed 
using the right Q -palindromic closure operator. To any word w in A* it associates the word w® which is 
the shortest G-palindromic word having w as prefix. It is clear that if s denotes the longest G-palindromic 
suffix of w then if we denote w — ps we have w® — psO{p). 

Let uq be a fixed finite word called seed. Let ^I* : ^* — > ^* be a map defined as follows 

*(e) = Mo, 
^i{ux) == (vI'(m)x)® for u e A* and x e A. 
For a given infinite word t we can define ^(i) as 

*(t) = lim * (pref„(i)) 

where pref„ stands for the prefix of length n. The word t is called the directive word of ^(t) and is 
denoted A(^(t)). The word 'i'(t) is called Q-standard word with seed if uq 7^ e otherwise it is called 
just 0-standard. 0-episturinian words are included in 0-standard words with seed (exact condition and 
details are given in [7]). 

As the definition of 0-standard word is constructive, a rough and direct condition to decide whether 
a 9-standard word is 9-rich is easy to derive. If the word ^ (prcf„ (t))t„ is 0-rich for all n > (with 
t = totit2 . . .), then *(t) is S-rich. 

The following proposition connects the class of 0-standard words with seed with the class of 0-rich 
words. 

Proposition 26 (Proposition 4.7, [7]). Let u be a Q-standard word with seed. Then there exists an 
integer M such that any prefix p of u of length greater than M has a unioccurrent Q-palindromic suffix. 

The integer M is explained in detail in the paper. It is clear that if M = 1, then u is 0-rich. However 
M ~ 1 implies some restrictions on Q and a criterion whether a 0-standard word with seed is 0-rich 
remains an open question. 
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